Search CORE

138 research outputs found

Sparsest factor analysis for clustering variables: a matrix decomposition approach

Author: A Stegeman
AJ Izenman
BS Everitt
C Spearman
CC Aggarwal
D Knowles
DM Zou
G Gan
GAF Seber
HH Harman
IT Jolliffe
J de Leeuw
JMF ten Berge
JMF ten Berge
K Adachi
K Adachi
K Adachi
K Hirose
K Hirose
Kohei Adachi
L Eldén
LR Goldberg
M Rattray
M Vichi
MJ Zaki
Nickolay T. Trendafilov
Nickolay T. Trendafilov
NT Trendafilov
NT Trendafilov
PT Costa
R Mazumder
R Reyment
S Unkel
SA Mulaik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/04/2017
Field of study

We propose a new procedure for sparse factor analysis (FA) such that each variable loads only one common factor. Thus, the loading matrix has a single nonzero element in each row and zeros elsewhere. Such a loading matrix is the sparsest possible for certain number of variables and common factors. For this reason, the proposed method is named sparsest FA (SSFA). It may also be called FA-based variable clustering, since the variables loading the same common factor can be classified into a cluster. In SSFA, all model parts of FA (common factors, their correlations, loadings, unique factors, and unique variances) are treated as fixed unknown parameter matrices and their least squares function is minimized through specific data matrix decomposition. A useful feature of the algorithm is that the matrix of common factor scores is re-parameterized using QR decomposition in order to efficiently estimate factor correlations. A simulation study shows that the proposed procedure can exactly identify the true sparsest models. Real data examples demonstrate the usefulness of the variable clustering performed by SSFA

Crossref

Open Research Online (The Open University)

CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion

Author: Adam J. Reid
AJ Enright
AJ Enright
Andrew B. Clegg
B Snel
C von Mering
C Yeats
Christine A. Orengo
CJ Marcotte
DE Barnes
EM Marcotte
F Bellivier
G Apic
I Yanai
Juan A. G. Ranea
K Truong
M Huynen
Magnus Rattray
P Resnik
PM Bowers
PW Lord
RD Finn
RD Finn
S Hoffman
SF Altschul
SK Kummerfeld
TF Smith
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Background: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic genomes due to the presence of large homologous domain families. Here we introduce CODA (Co-Occurrence of Domains Analysis), a method to predict functional associations based on the gene fusion idiom.Methodology/Principal Findings: We apply a novel scoring scheme which takes account of the genome-specific size of homologous domain families involved in fusion to improve accuracy in predicting functional associations. We show that CODA is able to accurately predict functional similarities in human with comparison to state-of-the-art methods and show that different methods can be complementary. CODA is used to produce evidence that a currently uncharacterised human protein may be involved in pathways related to depression and that another is involved in DNA replication.Conclusions/Significance: The relative performance of different gene fusion methodologies has not previously been explored. We find that they are largely complementary, with different methods being more or less appropriate in different genomes. Our method is the only one currently available for download and can be run on an arbitrary dataset by the user. The CODA software and datasets are freely available from ftp://ftp.biochem.ucl.ac.uk/pub/gene3d_data/v6.1.0/CODA/. Predictions are also available via web services from http://funcnet.eu/

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

UCL Discovery

Can sacrificial feeding areas protect aquatic plants from herbivore grazing? Using behavioural ecology to inform wildlife management

Author: A Jozkowicz
A Sih
AJ McLane
AK Pandit
BA Nolet
BA Nolet
C Bech
CD Ankney
CJ Spray
D van Vuren
DJ Decker
EC Rees
Francis Daunt
G Gayet
G Perry
GN Robb
GV Watola
H Blokpoel
HV McKay
I Gordon
J Sahlsten
JA Estes
JA Vickery
JA Vickery
JE Gross
JV López-Bao
KA Wood
KA Wood
KA Wood
KA Wood
KA Wood
KA Wood
KA Wood
Kevin A. Wood
KH Hodder
KM Ringelman
KS Tatu
LM Gosling
LP Hansen
M Kersten
M Owen
M Owen
M Owen
Matthew T. O’Hare
Maura (Gee) Geraldine Chapman
MJ Heydon
MR Conover
MR van Eerden
MT O’Hare
PV Rattray
RA Stillman
RA Stillman
Richard A. Stillman
RJ Greenwood
RJ Orr
S Boutin
S Takatsuki
SM Cooper
SM Percival
SM Redpath
SM Redpath
T Amano
T Amano
T Amano
TE Martin
W Meissner
WJ Sutherland
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 31/07/2014
Field of study

Effective wildlife management is needed for conservation, economic and human well-being objectives. However, traditional population control methods are frequently ineffective, unpopular with stakeholders, may affect non-target species, and can be both expensive and impractical to implement. New methods which address these issues and offer effective wildlife management are required. We used an individual-based model to predict the efficacy of a sacrificial feeding area in preventing grazing damage by mute swans (Cygnus olor) to adjacent river vegetation of high conservation and economic value. The accuracy of model predictions was assessed by a comparison with observed field data, whilst prediction robustness was evaluated using a sensitivity analysis. We used repeated simulations to evaluate how the efficacy of the sacrificial feeding area was regulated by (i) food quantity, (ii) food quality, and (iii) the functional response of the forager. Our model gave accurate predictions of aquatic plant biomass, carrying capacity, swan mortality, swan foraging effort, and river use. Our model predicted that increased sacrificial feeding area food quantity and quality would prevent the depletion of aquatic plant biomass by swans. When the functional response for vegetation in the sacrificial feeding area was increased, the food quantity and quality in the sacrificial feeding area required to protect adjacent aquatic plants were reduced. Our study demonstrates how the insights of behavioural ecology can be used to inform wildlife management. The principles that underpin our model predictions are likely to be valid across a range of different resource-consumer interactions, emphasising the generality of our approach to the evaluation of strategies for resolving wildlife management problems

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Bournemouth University Research Online

NERC Open Research Archive

Distinguishing Asthma Phenotypes Using Machine Learning Approaches.

Author: A Custovic
A Custovic
A Fraser
A Høst
A Pickles
A Simpson
A Wijga
Adnan Custovic
AJ Lowe
AV Berg
B Clarisse
BD Spycher
BD Spycher
BD Spycher
BG Toelle
BL Jones
BL Jones
C-M Chen
CA Figueiredo
CE Kuehni
CJ Lodge
CL Storr
D Barber
D Belgrave
D Caudri
D Nagin
DA Linzer
DC Belgrave
DC Belgrave
DCM Belgrave
DCM Belgrave
F Kauffmann
F Kauffmann
FD Martinez
FL Garden
FP Perera
G Bochenek
G Weinmayr
GB Marks
GP Anderson
J Hagenaars
J Henderson
J Lotvall
J Magidson
J Sunyer
J Winn
JA Smith
JK Vermunt
K Burnham
KE Wonderen Van
KL Nylund
L García-Marcos Álvarez
L Hunt
L Lowe
L Panico
LA Lowe
M Depner
M Herr
M Scott
Magnus Rattray
Mattia Prosperi
MJ Ege
ML Barreto
MM Hagendorens
MW Pijnenburg
N Lazic
NC Nicolaou
NG Papadopoulos
OE Savenije
P Burney
P Haldar
P Rzehak
P Rzehak
PD Sly
Q Chen
Q Vuong
Rebecca Howard
RJP Valk van der
RL Bergmann
RL Miller
RO Crapo
RT Stein
S American Thoracic
S Havstad
S Mihrshahi
S Rabe-Hesketh
S Stanojevic
SE Wenzel
SK Weiland
ST Lanza
ST Lanza
T Jung
T Minka
The European Community Respiratory Health Survey
V Siroux
WC Moore
X Robin
Y Lo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Asthma is not a single disease, but an umbrella term for a number of distinct diseases, each of which are caused by a distinct underlying pathophysiological mechanism. These discrete disease entities are often labelled as asthma endotypes. The discovery of different asthma subtypes has moved from subjective approaches in which putative phenotypes are assigned by experts to data-driven ones which incorporate machine learning. This review focuses on the methodological developments of one such machine learning technique-latent class analysis-and how it has contributed to distinguishing asthma and wheezing subtypes in childhood. It also gives a clinical perspective, presenting the findings of studies from the past 5 years that used this approach. The identification of true asthma endotypes may be a crucial step towards understanding their distinct pathophysiological mechanisms, which could ultimately lead to more precise prevention strategies, identification of novel therapeutic targets and the development of effective personalized therapies

Crossref

Springer - Publisher Connector

PubMed Central

Spiral - Imperial College Digital Repository

The University of Manchester - Institutional Repository

Genomic analysis of the function of the transcription factor gata3 during development of the Mammalian inner ear

Author: A Blokzijl
A Chiloeches
A Karis
A Oshima
A Verri
Adam Kneebone
AJ Nicholl
AR Conery
AY Xiao
B Fritzsch
BD Manning
C Niehrs
CKI Williams
Claire Johnson
CM Hurvich
D Davies
D Kurek
Daniela Cacciabue-Rivolta
DJC MacKay
DM Fekete
DM Fekete
DP Brazil
DR Alessi
DW Powell
ER Isenovic
G Camarero
G Camarero
G Lawoko-Kerali
G Lawoko-Kerali
G Lawoko-Kerali
G Schwartz
GM Findlay
Grace Lawoko-Kerali
H Lowenheim
H Mi
H Ohuchi
H Van Esch
H Van Esch
HA Seong
Hikke Van Doorninck
HS Kim
I Tachibana
I Varela-Nieto
J Chen
J Nardelli
J van der Wees
JM Kornhauser
JP Liu
JP Liu
K Ikeda
K Lillevali
K Lillevali
KM Murphy
KW Wirtz
KW Wirtz
M Hucka
M Levine
M Milo
M Rattray
Mahesan Niranjan
Marcelo Rivolta
Marta Milo
Matthew Holley
MB Eisen
MM Marelli
MN Rivolta
MN Rivolta
MN Rivolta
MW Kelley
N Aoki
NC Andrews
O Fromigue
P Chen
PP Pandolfi
R Benetti
R Helyer
RC Diaz
RD Pearson
RK Patient
RW Hendriks
S Abe
S Fujimoto
S Yang
SH Um
SP Yu
T Endo
T Sekimoto
T Sugatani
TF Franke
Thomas A. Reh
V Janzen
VM Smith
X Liu
X Shen
X Zhou
XD Peng
XQ Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/09/2009
Field of study

We have studied the function of the zinc finger transcription factor gata3 in auditory system development by analysing temporal profiles of gene expression during differentiation of conditionally immortal cell lines derived to model specific auditory cell types and developmental stages. We tested and applied a novel probabilistic method called the gamma Model for Oligonucleotide Signals to analyse hybridization signals from Affymetrix oligonucleotide arrays. Expression levels estimated by this method correlated closely (p<0.0001) across a 10-fold range with those measured by quantitative RT-PCR for a sample of 61 different genes. In an unbiased list of 26 genes whose temporal profiles clustered most closely with that of gata3 in all cell lines, 10 were linked to Insulin-like Growth Factor signalling, including the serine/threonine kinase Akt/PKB. Knock-down of gata3 in vitro was associated with a decrease in expression of genes linked to IGF-signalling, including IGF1, IGF2 and several IGF-binding proteins. It also led to a small decrease in protein levels of the serine-threonine kinase Akt2/PKB beta, a dramatic increase in Akt1/PKB alpha protein and relocation of Akt1/PKB alpha from the nucleus to the cytoplasm. The cyclin-dependent kinase inhibitor p27(kip1), a known target of PKB/Akt, simultaneously decreased. In heterozygous gata3 null mice the expression of gata3 correlated with high levels of activated Akt/PKB. This functional relationship could explain the diverse function of gata3 during development, the hearing loss associated with gata3 heterozygous null mice and the broader symptoms of human patients with Hearing-Deafness-Renal anomaly syndrome

Public Library of Science (PLOS)

Southampton (e-Prints Soton)

Crossref

Directory of Open Access Journals

PubMed Central

Erasmus University Digital Repository

White Rose Research Online

CDK targets Sae2 to control DNA-end resection and homologous recombination

Author: A Penkner
AA Sartori
AC Bishop
AH McKee
AJ Rattray
Alessandro A. Sartori
Andrés Aguilera
BM Lengsfeld
C Uanschou
E Baroni
E Karathanasis
F Cortes-Ledesma
F Esashi
F Lazzaro
Felipe Cortés-Ledesma
G Ira
J Chen
JM Hinz
K Lee
KS Lobachev
M Clerici
M Lisby
M Shrivastav
MD Mendenhall
MJ Neale
N Sugawara
O Limbo
O Puig
Pablo Huertas
S Prinz
SJ Boulton
Stephen P. Jackson
T Caspari
Y Aylon
Y Aylon
Y Pommier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

DNA double-strand breaks (DSBs) are repaired by two principal mechanisms: non-homologous end-joining (NHEJ) and homologous recombination (HR)1. HR is the most accurate DSB repair mechanism but is generally restricted to the S and G2 phases of the cell cycle, when DNA has been replicated and a sister chromatid is available as a repair template2-5. By contrast, NHEJ operates throughout the cell cycle but assumes most importance in G1 (refs 4, 6). The choice between repair pathways is governed by cyclin-dependent protein kinases (CDKs)2,3,5,7, with a major site of control being at the level of DSB resection, an event that is necessary for HR but not NHEJ, and which takes place most effectively in S and G2 (refs 2, 5). Here we establish that cell-cycle control of DSB resection in Saccharomyces cerevisiae results from the phosphorylation by CDK of an evolutionarily conserved motif in the Sae2 protein. We show that mutating Ser 267 of Sae2 to a non-phosphorylatable residue causes phenotypes comparable to those of a sae2Δ null mutant, including hypersensitivity to camptothecin, defective sporulation, reduced hairpin-induced recombination, severely impaired DNA-end processing and faulty assembly and disassembly of HR factors. Furthermore, a Sae2 mutation that mimics constitutive Ser 267 phosphorylation complements these phenotypes and overcomes the necessity of CDK activity for DSB resection. The Sae2 mutations also cause cell-cycle-stage specific hypersensitivity to DNA damage and affect the balance between HR and NHEJ. These findings therefore provide a mechanistic basis for cell-cycle control of DSB repair and highlight the importance of regulating DSB resection

Crossref

PubMed Central

idUS. Depósito de Investigación Universidad de Sevilla

Competition between Replicative and Translesion Polymerases during Homologous Recombination Repair in Drosophila

Author: A Witsell
AJ Rattray
AM Holmes
AR Lehmann
BS Plosky
C Guo
C Richardson
CE Smith
CE Smith
Daniel P. Kane
DM Johnson-Schlitz
E Johansson
E Sonoda
FC Gray
FH de Groote
GB Gloor
JN Kosarek
JR Lydeard
JR Lydeard
K Takata
KJ Gerik
L Maloisel
LS Waters
M Kohzaki
M McVey
M McVey
MD Adams
Michael Shusterman
Mitch McVey
MJ McIlwraith
MJ McIlwraith
MR Lieber
N Acharya
N Nassif
PE Gibbs
PL Andersen
R. Scott Hawley
S Sharma
SD McCulloch
SL Holbeck
T Ishikawa
T Kawamoto
T Ogi
TV Ho
WD Heyer
WM Hicks
X Li
Y Hirano
Yikang Rong
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In metazoans, the mechanism by which DNA is synthesized during homologous recombination repair of double-strand breaks is poorly understood. Specifically, the identities of the polymerase(s) that carry out repair synthesis and how they are recruited to repair sites are unclear. Here, we have investigated the roles of several different polymerases during homologous recombination repair in Drosophila melanogaster. Using a gap repair assay, we found that homologous recombination is impaired in Drosophila lacking DNA polymerase zeta and, to a lesser extent, polymerase eta. In addition, the Pol32 protein, part of the polymerase delta complex, is needed for repair requiring extensive synthesis. Loss of Rev1, which interacts with multiple translesion polymerases, results in increased synthesis during gap repair. Together, our findings support a model in which translesion polymerases and the polymerase delta complex compete during homologous recombination repair. In addition, they establish Rev1 as a crucial factor that regulates the extent of repair synthesis

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

The distribution of inverted repeat sequences in the Saccharomyces cerevisiae genome

Although a variety of possible functions have been proposed for inverted repeat sequences (IRs), it is not known which of them might occur in vivo. We investigate this question by assessing the distributions and properties of IRs in the Saccharomyces cerevisiae (SC) genome. Using the IRFinder algorithm we detect 100,514 IRs having copy length greater than 6 bp and spacer length less than 77 bp. To assess statistical significance we also determine the IR distributions in two types of randomization of the S. cerevisiae genome. We find that the S. cerevisiae genome is significantly enriched in IRs relative to random. The S. cerevisiae IRs are significantly longer and contain fewer imperfections than those from the randomized genomes, suggesting that processes to lengthen and/or correct errors in IRs may be operative in vivo. The S. cerevisiae IRs are highly clustered in intergenic regions, while their occurrence in coding sequences is consistent with random. Clustering is stronger in the 3′ flanks of genes than in their 5′ flanks. However, the S. cerevisiae genome is not enriched in those IRs that would extrude cruciforms, suggesting that this is not a common event. Various explanations for these results are considered

Crossref

Boston University Institutional Repository (OpenBU)

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Are We Predicting the Actual or Apparent Distribution of Temperate Marine Fishes?

Planning for resilience is the focus of many marine conservation programs and initiatives. These efforts aim to inform conservation strategies for marine regions to ensure they have inbuilt capacity to retain biological diversity and ecological function in the face of global environmental change – particularly changes in climate and resource exploitation. In the absence of direct biological and ecological information for many marine species, scientists are increasingly using spatially-explicit, predictive-modeling approaches. Through the improved access to multibeam sonar and underwater video technology these models provide spatial predictions of the most suitable regions for an organism at resolutions previously not possible. However, sensible-looking, well-performing models can provide very different predictions of distribution depending on which occurrence dataset is used. To examine this, we construct species distribution models for nine temperate marine sedentary fishes for a 25.7 km2 study region off the coast of southeastern Australia. We use generalized linear model (GLM), generalized additive model (GAM) and maximum entropy (MAXENT) to build models based on co-located occurrence datasets derived from two underwater video methods (i.e. baited and towed video) and fine-scale multibeam sonar based seafloor habitat variables. Overall, this study found that the choice of modeling approach did not considerably influence the prediction of distributions based on the same occurrence dataset. However, greater dissimilarity between model predictions was observed across the nine fish taxa when the two occurrence datasets were compared (relative to models based on the same dataset). Based on these results it is difficult to draw any general trends in regards to which video method provides more reliable occurrence datasets. Nonetheless, we suggest predictions reflecting the species apparent distribution (i.e. a combination of species distribution and the probability of detecting it). Consequently, we also encourage researchers and marine managers to carefully interpret model predictions

CiteSeerX

Public Library of Science (PLOS)

Deakin Research Online

Crossref

Directory of Open Access Journals

PubMed Central

espace@Curtin